Power Log’n’Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols
نویسندگان
چکیده
In fault tolerance for parallel and distributed systems, message logging protocols have played a prominent role in the last three decades. Such enable local rollback to provide recovery from fail-stop errors. Global techniques can be straightforward implement but at times lead slower than rollback. Local is more complicated offer faster times. this work, we study power energy efficiency implications of global We propose power-efficient version reduce consumption non-critical, blocked processes, using xmlns:xlink="http://www.w3.org/1999/xlink">Dynamic Voltage Frequency Scaling (DVFS) xmlns:xlink="http://www.w3.org/1999/xlink">clock modulation (CM). Our results 3 different MPI codes on 2 systems show that reduces CPU waste up 50% during phase, compared existing techniques, without introducing significant overheads. Furthermore, savings manifest all blocked which grow linearly with process count. estimate settings high overheads total reduced proposed
منابع مشابه
Efficient Message Logging for Uncoordinated Checkpointing Protocols
HAL is a multidisciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt età la diffusion de documents scientifiques de niveau r...
متن کاملUsing Message Semantics to Reduce Rollback in Optimistic Message Logging Recovery Schemes
Recovery from failures can be achieved through asyn-chronous checkpointing and optimistic message logging. These schemes have low overheads during failure-free operations. Central to these protocols is the determination of a maximal consistent global state, which is recoverable. Message semantics is not exploited in most existing recovery protocols to determine the recoverable state. We propose...
متن کاملA Fast Rollback-Recovery Scheme based on Optimistic Message Logging
This paper presents an eecient rollback recovery scheme based on the optimistic message logging. To speed up the recovery process, the rollback point of the failed process is broadcast and other processes asynchronously make the rollback decision based on the vector time. Asynchronous recovery process usually causes two possible problems: One is the message delivered from an invalid state inter...
متن کاملFlexible Power Electronic Transformer for Power Flow Control Applications
This paper proposes a Flexible Power Electronic Transformer (FPET) for the application in the micro-grids. The low frequency transformer is usually used at the Point of Common Coupling (PCC) to connect the low voltage grid and utility network to each other. The conventional 50Hz transformer results in enhanced low voltage-grid power management system during grid-connected operation. In this pap...
متن کاملConsistent Rollback Protocols for Autonomic ASSISTANT Applications
Nowadays, a central issue for applications executed on heterogeneous distributed platforms is represented by assuring that certain performance and reliability parameters are respected throughout the system execution. A typical solution is based on supporting application components with adaptation strategies, able to select at run-time the better component version to execute. It is worth noting ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems
سال: 2022
ISSN: ['1045-9219', '1558-2183', '2161-9883']
DOI: https://doi.org/10.1109/tpds.2021.3107745